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SPECIFICATION 

Multichain polypeptides or proteins and processes for their production 

5 This invention relates to multichain polypeptides or proteins and processes for their production 5 
in cells of host organisms which have been transformed by recombinant DNA techniques. 

In recent years advances in molecular biology based on recombinant DNA techniques have 
provided processes for the production of heterologous (foreign) polypeptides or proteins in host 
cells which have been transformed with heterologous DNA sequences which code for the 

1 0 production of these products. 1 0 

Theoretically, the recombinant DNA approach may be applied to the production of any 
heterologous polypeptide or protein in a suitable host cell, provided that appropriate DNA 
coding sequences can be identified and used to transform the host cell. In practice, when the 
recombinant DNA approach was first applied to the production of commercially useful products, 

1 5 its application for the production of any specified polypeptide or protein presented particular 1 5 
problems and difficulties, and the success of applying this approach to the production of any 
particular polypeptide or product was not readily predictable. 

However, a large number of heterologous single chain polypeptides or proteins have now 
been produced by host cells transformed by recombinant DNA techniques. Examples of such 

20 heterologous single chain polypeptides or proteins include human interferons, the A and B 20 
chains of human insulin, human and bovine growth hormone, somatostatin, calf prochymosin 
and urokinase. Such transformed host cells provide a reproducible supply of authentic 
heterologous polypeptide or protein which' may be produced on an industrial scale using 
industrial fermentation technology. 

25 It should be pointed out that some of these polypeptides, for instance urokinase, after 25 
secretion by a host cell appear as two chain molecules. However, in such cases, the molecule is 
synthesised by the host cell as a single chain polypeptide, coded for by a single DNA sequence, 
which is cleaved in the host cell subsequent to synthesis to form the two chain structure. 
It is known that in both human and animal systems there are a number of polypeptides or 

30 proteins which have multichain structure in which the chains are not derived from the cleavage 30 
of a single chain polypeptide coded for by a single DNA sequence. In such cases, the gene for 
each of the chains may be located at different points on the same chromosome or even on 
different chromosomes. In these cases, the polypeptide chains are synthesised separately and 
then assembled into the complete molecule subsequent to synthesis. Heretofore, no such 

35 multichain polypeptide or protein has been produced by recombinant DNA techniques from a 35 
single host cell. 

A particular example of a class of such multichain polypeptides or proteins is the immunoglo- 
bulins. 

Immunoglobulins, commonly referred to as antibodies, are protein molecules produced in 

40 animals by B-lymphocyte cells in response to challenge with foreign antigenic agents, such as 40 
bacteria, viruses and foreign proteins. The immunoglobulins comprise a crucial part of the 
immune systems of humans and animals. The immunoglobulins recognise specific parts of the 
foreign agents and bind onto them. The specific parts are usually known as antigenic 
determinants or antibody binding sites. A given foreign agent is likely to have a number of 

45 different antigenic determinants. 45 
A typical immunoglobulin (Ig) molecule is shown in Fig. 1 of the accompanying drawings, to 
which reference is now made. The Ig molecule comprises two identical polypeptide chains of 
about 600 amino acid residues (usually referred to as the heavy chains H), disulphide bonded to 
each other, and two identical shorter polypeptide chains of about 220 amino acid residues 

50 (usually referred to as the light chains L), each light chain being disulphide bonded to one end 50 
of each heavy chain as shown. 

When the Ig molecule is correctly folded, each chain is formed into a number of distinct 
globular areas, usually known as domains, joined by a more linear polypeptide chain. The light 
chains have two such domains, one of which is of variable sequence V L , and the other of which 

55 is of constant sequence C t . The heavy chains have a single variable domain V H adjacent the 55 
variable domain V t of the light chain, and three or four constant domains C H ,_3 0r4 - 

The variable domains on both the heavy and the light chains each contain three hypervariable 
regions (HV1-3) which, when the Ig molecule is correctly folded, are located adjacent one 
another and form the antigen binding site. It is this area which recognises and binds to the 

60 antigenic determinant for which the Ig molecule is specific. 60 
The constant domains of the Ig molecule do not take part in the binding to the antigenic 
determinant, but mediate the actions triggered by the binding of the Ig molecule to the 
antigenic determinant. It is believed that this triggering is caused by an allosteric effect induced 
by the binding of the Ig molecule to the antigenic determinant. The constant domain may 

65 enable the Ig molecule to fix complement or may cause mast cells to release histamine. 65 
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Ig's may be categorised by class or subclass, depending on which of a number of possible 
heavy chain constant domains they contain, there being eight such possible heavy chains in 
mice. Thus, for instance, an Ig molecule with a jn heavy chain belongs to the class IgM, and one 
with a heavy chain to the class lgG v 
5 Ig's may also contain one of two light chains, designated as k and \ light chains, which have 
different constant domains and different sets of variable domains. 

The structure of the Ig molecule and the location of the genes coding for the various domains 
thereof are discussed more fully by Early and Hood, in Genetic Engineering, Principles and 
Methods, Vol.3, pages 153-158 (edited by Setlow and Hollaender, Plenum Press). 

10 It is known that Ig molecules on digestion with selected enzymes can produce a number of 
immunologically functional fragments. Two such fragments are known as the F ab and (F >b ') 2 
fragments. The F, b fragment comprises one light chain linked to the V H and C H1 domains of a 
heavy chain as shown in Fig. 1 . The (F ob ') 2 fragment consists essentially of two F, b fragments 
linked together by a small additional portion of the heavy chains as shown in Fig. 1 . These 

1 5 fragments and other similar fragments can be of use in various tests and diagnostic and medical 
methods. 

The principle method employed for the production of Ig's involves the immunisation of 
susceptible animals with the antigenic agent to provide an immune reaction. Generally the 
animal is immunised a second time to improve the yield of Ig. The animal is then bled and the 
20 Ig is recovered from the serum. 

However, the product of this method is not a homogeneous protein. The animal will produce 
Ig of different classes and also Ig specific for each of the different antigenic determinants on the 
antigenic agent, and its blood will therefore contain a heterogenous mixture of Ig's. Obtaining a 
specific Ig of particular class and desired specificity from such a mixture requires very difficult 
25 and tedious purification procedures. 

Recently, it has become possible to produce a homogeneous Ig of a single class and a single 
specificity by a technique first described by Kohler and Milstein (Nature, 256, 495-479, 
1 975). The technique involves the fusion of single Ig-prpducing parent cells with cancer cells to 
produce a monoclonal hydbridomacell which produces the Ig. Ig produced by this technique is 
30 usually known as monoclonal antibody. The nature of the monoclonal Ig is determined by the 
class and specificity of the Ig produced by the parent cell. 

Recently, attempts have been made to use recombinant DNA techniques to produce 
fragments of Ig molecules. For instance, Amster et al. (Nucleic Acid Research, 8, No.9, 1980, 
pp 2055 to 2065) disclose the cloning of double stranded cDNA sequences encoding for a 
35 mouse Ig light chain into a plasmid. An E. Coli strain transformed by the plasmid synthesised a 
protein thought to comprise the complete constant domain of the light chain and about 40 
amino acid residues of its variable region. 

Kemp and Cowman (Proc. Natl. Acad, Sci. USA, 78, 1981, pp 4520 to 4524) disclose the 
cloning of cDNA sequences encoding for mouse heavy chain fragments and the transforming of 
40 an E. Coli strain which then synthesised heavy chain polypeptide fragments. 

In both these cases, the polypeptides were produced as fusion proteins, in which the 
fragments of the Ig polypeptides were fused with additional non-lg polypeptide sequences, and 
with incomplete variable domains. Thus, the polypeptide chains produced in these studies were 
not immunologically functional polypeptides as they were incapable of combining with comple- 
45 mentary heavy or light chains to provide Ig molecules having intact antigen binding sites and 
immunological function. 

Research studies have also been carried out in mammalian systems. For instance, Falkner and 
Zachau (Nature, 298, 1 982, pp 286 to 288) report the cloning of cDNA sequences encoding 
for mouse light chains into a plasmid which was used to transfect genomic eukaryotic cells 
50 which could then transiently synthesise light chains. 

Rice and Baltimore (Proc.Natl. Acad, Sci. USA, 79, 1982, pp 7862 to 7865) report on the 
transfection of a functionally rearranged K light chain Ig gene into a murine leukemia virus- 
transformed lymphoid cell line. The cell line is then able to express the gene continuously. In 
both these cases, the K genes used to transfect the mammalian cells were obtained from 
55 myeloma cells and the K polypeptides produced were of indeterminate immunological function. 
A further approach is exemplified in a series of papers by Valle et al. (Nature, 291, 1981, 
338-340; Nature, 300, 1982, 71-74; and J. Mol. Biol., 760, 1982, 459-474), which 
describe the microinjection of mRNAs encoding for heavy or light chains of Ig isolated from a 
mouse myeloma line into oocytes of Xenopus laevis. Under certain conditions complete Ig 
60 molecules were formed. However, the mRNAs were obtained from myeloma cells and the Ig 
molecules were of indeterminate immunological function. 

It can thus be seen that hitherto it has not been possible to produce functional Ig by 
recombinant DNA technology. 
According to a first aspect of the present invention, there is provided a process for producing 
65 a heterologous multichain polypeptide or protein in a single host cell, which comprises 
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transforming the host cell with DNA sequences coding for each of the polypeptide chains and 

expressing said polypeptide chains in said transformed host cell. 
According to a second aspect of the present invention, there is provided a heterologous 

multichain polypeptide or protein produced by recombinant DNA technology from a single host 
5 cell. 5 
The present invention is of particular, but not exclusive, application in the production of Ig 

molecules and immunologically functional Ig fragments of the type referred to above. However, 

it will be appreciated that the present invention can be applied to the production of other 

multichain polypeptides or proteins. 
10 In relation to the product of Ig molecules according to the invention it will be appreciated 10 

that, in order to produce a functional molecule, the DNA sequences used to transform the host 

cell will need to encode for at least the V L and V H domains of an Ig molecule. Moreover, these 

domains will need to be complementary so that when the two polypeptide chains fold together 

they form an antigen binding site of predetermined specificity. 
1 5 Preferably, the Ig molecule or fragment includes a complete light chain and at least the C H , 1 5 

domain in addition to the V H domain of the heavy chain. Most preferably the Ig molecule is 

intact. 

It has also been shown by the present applicants that it is now possible to produce individual 
heavy and light chains having intact variable domains. This has not previously been possible. 
20 Therefore, according to a third aspect of the present invention there is provided as a product of 20 
recombinant DNA technology an Ig heavy or light chain or fragment thereof having an intact 
variable domain. 

Advantageously, the Ig molecule or functional fragment thereof according to the present 
invention has a variable region (formed by the V L and V H domains) which defines a binding site 

25 for an antigenic determinant of clinical or industrial importance. The DNA coding sequences 25 
necessary to produce such a molecule may be derived from naturally occurring or hybridoma 
(monoclonal)lg-producing cells with the desired specificity. 

The constant domains of the Ig molecule or fragment, if present, may be derived from the 
same cell line as the variable region. However, the constant domains may be specifically altered, 

30 partially or completely omitted, or derived from a cell line producing a different class of Ig to 30 
provide Ig molecules or fragments having desired properties. 

For example, an Ig molecule may be produced having variable domains (V H and V L ) idential 
with those from a monoclonal antibody having a desired specificity, and constant domain(s) 
from a different monoclonal antibody having desired properties, for instance to provide human 

35 compatibility or to provide a complement binding site. 35 
Such alterations in the amino acid sequence of the constant domains may be achieved by 
suitable mutation or partial synthesis and replacement or partial or complete substitution of 
appropriate regions of the corresponding DNA coding sequences. Substitute constant domain 
portions may be obtained from compatible recombinant DNA sequences. 

40 The invention may be utilised for the production of Ig molecules or fragments useful for 40 
immunopurification, immunoassays, cytochemical labelling and targetting methods, and 
methods of diagnosis or therapy. For example, the Ig molecule or fragment may bind to a 
therapeutically active protein such as interferon or a blood clotting factor, for example Factor 
VIII, and may therefore be used to produce an affinity chromatorgraphy medium for use in the 

45 immunopurification or assay of the protein. 45 
It is also envisaged that the Ig molecule may be synthesised by a host cell with another 
peptide moiety attached to one of its constant domains. Such a further peptide moiety may be 
cytotoxic or enzymatic. Alternatively, the moiety may be useful in attaching the Ig molecule to a 
biological substrate, such as a cell or tissue, or to a non-biological substrate, such as a 

50 chromatography medium. Such a peptide moiety is herein referred to as a structural peptide 50 
moiety. 

It is further envisaged that cytotoxic, enzymic or structural peptide moieties could be attached 

to the Ig molecule by normal peptide chemical methods, as are already known in the art, rather 

than by being synthesised with the Ig molecule. 
55 The Ig molecule or fragment may also comprise a therapeutic agent in its own right. For 55 

instance, an Ig molecule or fragment specific for D blood group antigen may be useful for the 

prevention of haemolytic disease of the new born. 
Any suitable recombinant DNA technique may be used in the production of the multichain 

polypeptides or proteins of the present invention. Typical expression vectors such as plasmids 
60 are constructed comprising DNA sequences coding for each of the chains of the polypeptide or 60 

protein. 

It will be appreciated that a single vector may be constructed which contains the DNA 
sequences coding for more than one of the chains. For instance, the DNA sequences coding for 
Ig heavy and light chains may be inserted at different positions on the same plasmid. 
65 Alternatively, the DNA sequence coding for each chain may be inserted individually into a 65 
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plasmid, thus producing a number of constructed plasmids, each coding for a particular chain. 
Preferably the plasmids into which the sequences are inserted are compatible. 

The or each constructed plasmid is used to transform a host cell so that each host cell 
contains DNA sequences coding for each of the chains in the polypeptide or protein. 
5 Suitable expression vectors which may be used for cloning in bacterial systems include 
plasmids, such as Col E1, pcR1, pBR322, pACYC 184 and RP4, phage DNA or derivatives of 
any of these. 

For use in cloning in yeast systems, suitable expression vectors include plasmids based on a 2 
micron origin. 

1 0 Any plasmid containing an appropriate mammalian gene promoter sequence may be used for 

cloning in mammalian systems. Such vectors include plasmids derived from, for instance, 

pBR322, bovine papilloma virus, retroviruses, DNA viruses and vaccinia viruses. 
Suitable host cells which may be used for expression of the heterologous multichain 

polypeptide or protein include bacteria, such as E.Coli and B. Subtilis, streptomyces, yeasts, 
15 such as S. cervisiae, and eukaryotic cells, such as insect or mammalian cell lines. Examples of 

suitable bacterial host cells include E. Coli HB 101, E. Coli X1776, E. Coli X2882, E. Coli PS 

410, E. Coli MRC 1, E. Coli RV308, E. Coli E103S and E. Coli B. 
The present invention also includes constructed expression vectors and transformed host cells 

for use in producing the multichain polypeptides or proteins of the present invention. 
20 After expression of the individual chains in the same host cell, they may be recovered to 

provide the complete multichain polypeptide or protein in active form, for instance to provide an 

Ig molecule of predetermined immunological function. 
It is envisaged that in preferred forms of the invention, the individual chains will be processed 

by the host cell to form the complete polypeptide or protein which advantageously is secreted 
25 therefrom. 

However, it may be that the individual chains may be produced in insoluble or membrane- 
bound form. It may therefore be necessary to solubilise the individual chains and allow the 
chains to refold in solution to form the active multichain polypeptide or protein. A suitable 
procedure for solubilising polypeptide chains expressed in insoluble or membrane-bound form is 
30 disclosed in our copending application No. (Protein Recovery, Agent's Ref. GF 402120 and 
402121). 

It will be appreciated that the present application shows for the first time that it is possible to 
transform a host cell so that it can express two or more separate polypeptides which may be 
assembled to form a complete multichain polypeptide or protein. There is no disclosure or 
35 suggestion of the present invention in the prior art, which relates solely to the production of a 
single chain heterologous polypeptide or protein from each host cell. 

The present invention will now be described by way of example only, with reference to the 
accompanying drawings, in which: — 

Figure 1 shows a diagrammatic representation of a typical intact Ig molecule; 
40 Figure 2 shows the construction of plasmids for the direct synthesis of a \ light chain in E. 
Coli; 

Figure 3 shows the construction of plasmids for the direct synthesis of a ft heavy chain in E. 
Coli; 

Figure 4 is a diagrammatic representation of \i mRNA sequences around the initiation codon; 
45 Figure 5 shows the construction of plasmids having altered secondary structure around the 
initiation codon; 

Figure 6 is a polyacrylamide gel showing expression and distribution of ju protein from E. Coli 

B; 

Figure 7 is a polyacrylamide gel showing pulse chase autoradiograms of ju protein in E. Coli B 
50 and in E. Coli HB101; 

Figure 8 is a polyacrylamide gel showing the results of A gene expression in E. Coli; 

Figure 9 is a polyacrylamide gel showing the distribution of recombinant \ light chain 
polypeptide between the soluble and insoluble cell fractions; 

Figure 1 0 is a polyacrylamide gel showing expression and distribution of \ protein from E. 
55 ColiE103S; 

Figure 1 1 shows the results of the fracitonation of ju and A protein expressed by E. Coli B on 
DEAE Sephacel; 

Figure 12 shows the specific hapten binding of reconstituted Ig molecules; and 
Figure 13 shows the heteroclitic nature of the hapten binding of the reconstituted Ig 
60 molecules. 

In the following examples, there is described the production of Ig light and heavy chain 
polypeptides derived from monoclonal antibodies which recognise and bind to the antigenic 
determinant 4-hydroxy-3-nitrophenyl acetyl (NP), using E.coli and S. cerevisiae as the host cells 
Recombinant DNA techniques were used to enable the host cells to express both the 
65 polypeptide chains. 
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It will be appreciated that the invention is not limited to the specific methods and construction 
described hereafter. 

Construction of Lambda Light Chain Expression Plasmid 

5 Fig. 2, to which reference is now made, shows schematically the method used to construct a 
X light chain expression plasmid. 

It was decided to express the lambda gene in E. coli by direct expression of the gene lacking 
the eucaryotic signal peptide but containing a methionine initiator residue at the amino-terminus 
(met-lambda). The approach used for bacterial synthesis of met-lambda was to reconstruct the 

10 gene in vitro from restriction fragments of a cDNA clone and to utilise synthetic DNA fragments 
for insertion into the bacterial plasmid pCT54 (Emtage et al„ Proc. Natl. Acad. Sci. USA., 80, 
3671 to 3675, 1 983). This vector contains the E. co//trp promoter, operator and leader 
ribosome binding site; in addition 1 4 nucleotides downstream of the ribosome binding site is an 
initiator ATG followed immediately by EcoR1 and Hindlll sites and the terminator for E. coli 

15 RNA polymerase from bacteriophage T7. 

As a source of light chain we used a plasmid pABM-1 5 which contains a full-length A, light 
chain cDNA cloned into the Pstl site of pBP.322. This X, light chain is derived from a 
monoclonal antibody, designated S43, which binds to 4-hydroxy-3-nitrophenylacetyl (NP) 
haptens. . . 

20 In order to create a Hindlll site 3' to the end of the lambda gene for insertion into the Hindlll 
site of pCT54, the cDNA was excised from pABX1 -1 5 using Pst1 . The cohesive ends were 
blunt ended using the Klenow fragment of DNA polymerase and synthetic Hindlll linker 
molecules of sequence 5'-CCAAGCTTGG-3' ligated. The DNA was digested with Hindlll and the 
850bp lambda gene isolated by gel electrophoresis and cloned into Hindlll cut pAT153 to yield 

25 plasmid pATX1-15. The 3' end of the lambda gene was isolated from pAT\1-15 by Hindlll 
plus partial Sad digestion as a 630bp Sacl-Hindlll fragment (2 in Fig. 2). The Hindlll cohesive 
end was dephosphorylated by calf intestinal alkaline phosphatase during isolation of the 
fragment to prevent unwanted ligations at this end in subsequent reactions. 
A Hinfl restriction site is located between codons 7 and 8 and the lambda sequence. The 5' 

30 end of the lambda gene was isolated as a 148bp Hinfl to Sad fragment (1 in Fig. 2). 

Two oligodeoxyribonucleotides were designed to restore codons 1 -8, and to provide an 
initiator ATG as well as Bc1 1 and Hinfl sticky ends. The two chemically synthesised oligonucleo- 
tides made to facilitate assembly of the gene had the sequences: 

35 R45 5' -pGATCAATGCAGGCTGTTGTG 3' 

R44 3' CCGACAACACTGAGTCCTTAp- 5' 

pCT54 was cut with both Bcl1 and Hindlll and the resulting linear molecules isolated, mixed 
together with the two oligodeoxyribonucleotide linkers R44 and R45 and both fragments 1 and 
40 2, and ligated using T4 ligase (Fig. 2). The mixture was used to transform E. coli DH1 to 
ampicillin resistance. Recombinant clones in pCT54 were identified by hybridisation of DNA 
from replica plated colonies on nitrocellulose to a nick-translated probe derived from the 
pATXl-1 5 insert. „ . 

A clone was identified which hybridised to lambda cDNA and also showed the predicted 
45 restriction fragment pattern. This plasmid (designated pCT54 19-1) was sequenced from the 
Clal site and shown to have the anticipated sequence except that there was a mutation of the 
fourth codon from CTG to ATG, changing the amino acid at this point from valine to 
methionine. 

The sequence in this area was: 

50 

. . . GATTGATCA.ATG.CAG.GCT.GTT.ATG.ACT.CAG.GAA.TCT.GCA.CTC.ACC.ACA.TCA . . 
met gin ala val met thr gin glu ser ala leu thr thr ser 

The restriction enzyme sites in pCT54 between the Shine and Dalgarno sequence (AAGG), 
55 which is important for ribosome binding, and the ATG allow for the adjustment of the SD-ATG 
distance, an important parameter in determining expression rates. The SD-ATG distance was 
reduced by cutting the plasmid with Clal or Bell and creating blunt ended species by digestion 
with S, nuclease. 2 jug of Clal cut DNA was digested with 200 units of S, nuclease for 30 
minutes at 30° using standard buffer conditions. The solution was deproteinised with phenol 
60 and the DNA recovered by ethanol precipitation. This DNA on religation with T4 DNA ligase and 
transformation into E.coli strain HB101 gave rise to a number of plasmids which had lost the 
Clal or Bell site. The plasmids which had lost their Clal site were sequenced in the region 
surrounding the initiator ATG. 

65 ..XXX 
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. . . . AAG G GTATTG ATC AATG CAG. . . . plasmid P NP3 
SD met glu 

5 . . . .AAGGGTTTGATCAATG CAG plasmid pNP4 
SD met glu 

In order to achieve high level expression a number of other approaches were followed. Firstly, a 
series of constructs were obtained which had increasing amounts of the 3' untranslated region 

10 of the cDNA removed by Bal 31 exonuclease. Secondly, a high copy number plasmid containing 
X cDNA was constructed. This plasmid contained a par function (Meacock, P. A. and Cohen, 
S.N.Cell, 20. 529-542, 1980) as well as being present in high copy number. Thirdly, the 
pNP3 plasmid was transformed into a number of protease-deficient strains or into HB101 in 
conjunction with a protease deficient dominant acting plasmid (Grossman, A.D.et al Cell, 32, 

15 151-159, 1983). 

Construction of fi Heavy Chain Expression Plasmid 

The full-length ft heavy chain cDNA derived from the NP binding monoclonal antibody B1-8 
had been cloned into the Pstl site of pBR322 yielding a plasmid designated pAB/i-1 1 (Bothwell 
20 et al, Cell, 24, 625-637, 1981). In order to achieve high level expression, the ft cDNA minus 
the eukaryotic leader was reconstructed into pCT54. The construction of the M heavy chain 
expression plasmid is shown diagramatically in Fig. 3. Two chemically synthesised oligonucleo- 
tides were made to facilitate this. These have the following sequences: 

25 R43 5' GATCAATGCAGGTTCAGCTGCA 3' 
R46 3' TTACGTCCAAGTCG 5' 

These were ligated into Bell cut pCT54 using T4 DNA ligase and the resulting plasmid 
designated pCT54 Pst. The linkers were designed to replace the sequence in pCT54 between 

30 the Bell site and the ATG, to provide an internal Pstl site and to recreate the sequence from 
pABju-1 1 5' to the Pstl site up to codon + 1 . pCT54 Pst was cut briefly with Pstl, treated with 
alkaline phosphatase and full length DNA glass isolated from a 1% agarose gel. 

Similarly pAB/t-1 1 was briefly cut with Pstl and the full length fi insert isolated following 
agarose gel electrophoresis. The n cDNA was ligated with the full length pCT54Pst fragment 

35 using T4 DNA ligase under standard conditions and a plasmid designated pCT54ju was 

identified which by restriction enzyme analysis was shown to contain a full-length /i insert. The 
plasmid was sequenced around the 5' linker region and was found to have the anticipated 
sequence: 

5' . . . .TGATCAATGCAGGTTCAGCTGCAGGGGGGGGATGGGATGGAG. ... 3', demonstrating 
40 that it was, indeed, a full length clone. A complete Pstl digest of pCT54ju liberated a 1 .4 Kb 
fragment which was purified by 0.8% agrose gel electrophoresis and glass powder isolation. 
This was ligated with Pstl cut pCT54 Pst (see above) using T4 DNA ligase under standard 
conditions, followed by transformation into HB101. A plasmid designated pNP1 was isolated 
which was shown by restriction endonuclease pattern analysis to contain the 1.4 Kb jucDNA 
45 fragment in an appropriate orientation (Fig. 3). pNPI was a plasmid which consisted of an 
appropriate 5' end for expression, whilst pCT54 contained an appropriate 3' end. The full 
length gene was reconstructed into pCT54 by cutting both pNPI and PCT54/1 with Aval which 
cuts once in the plasmid and once in the /x gene. Both digests were run on a 1 % agrose gel and 
the 1.9 Kb fragment from pNP1 and the 3.65 Kb fragment from pCT54ju isolated. Following 
50 alkaline phosphatase treatment of the 3.65 Kb pCT54ft fragment, the two pieces of DNA were 
ligated to each other and transformed into HB101. A plasmid designated pNP2 was identified 
which demonstrated the correct restriction endonuclease pattern. It was sequenced in the area 
surrounding the initiator ATG and found to have the anticipated sequence: 

55 5' . . . .TTGATCAATGCAGGTTCAGCTGCAGCAGCCTGGGGCTGAGCTTGTGAAG. ... 3' 

The vector pCT54 had been constructed to include two restriction sites (BcM and Cla1) 
between the S-D sequence (AAGG) and the initiation codon so that the distance between these 
sequence elements could be varied. As most E.coli mRNAs have 6-1 1 nucleotides between the 

60 S-D sequence and the AUG the distance in pNP2 was reduced by modification at the Cla1 site. 
pNP2 was cut with Cla1 and incubated with S1 . The amount of S1 nuclease was adjusted so 
that some DNA molecules would lose 1 -2 extra base pairs as a result of 'nibbling' by the 
enzyme. This DNA on religation with T4 DNA ligase and transformation into E.coli strain HB101 
gave rise to a number of colonies harbouring plasmids which had lost the Cla1 site. The 

65 sequences around the initiation codon of four plasmids pNP223, pNP261, pNP9 and pNP282 
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The expression of these constructs in E. coli B was examined in three experiments, and 
concentrations of ft protein were determined by ELISA as relative units (ru), a measure of /i 
concentration normalised to pNP9. The results are shown in Table 1 . 

Inductions were carried out by resuspending cells at 1 :50 dilution from overnight cultures into 
5 induction medium, consisting of: KH 2 P0 4 (3g/L), Na 2 HP0 4 (6g/L), NaCI (0.5g/L), Difco vitamin 5 
assay casamino acids (30g/L), NH 4 CI (4g/L) glycerol (16g/L), proline (0.5g/L), Difco yeast 
extract (1g/L), CaCI 2 . 6H 2 0 (0.022g/L), MgS0 4 .7H 2 0 (0.025g/L), thiamine (0.01g/L) and 
carbenicillin (0.1g/L). Cultures were shaken at 37°C and cells harvested by centrifugation. 

ELISA assays were performed essentially as described by Patel et al. (Nucleic Acid Research, 
10 10, 5605 to 5620, 1982), using affinity purified goat anti-mouse IgM (Tago) and affinity 10 
purified goat anti-mouse IgM (Tago) conjugated toperoxidase, with 3, 3', 5, 5' tetramethylbenzi- 
dine (Miles) as a substrate. Bacterial cell pellets were boiled for 2 minutes in 9M urea (BRL) and 
loaded directly into the ELISA, such that all wells contained 2.25M urea, 10mM Tris-HCI 
pH7.8, 150mM NaCI, 0.05% NP40 and 0.5% casein (Hammarsten, B.D.H.). 
1 5 Altering the S-D to ATG distance with nuclease S1 was found to increase the level of ft 15 
expression, relative to the parental plasmid pNP2, but only to a small extent (compare pNP9). 
The optimal S-D to AUG distance was found to be 9-10 nucleotides, as found in pNP9 and 
pNP261 respectively. 

20 Secondary structure Analysis 20 
Hairpin loops were identified by use of the computer programme HAIRGU developed by Dr. 
Roger Staden. The AG values were calculated as described by Trioco et al. (Nature NB, 246, 
40-41, 1973). 

Analysis of the potential secondary structure of ft mRNA encoded by pNP2 and pNP9 (one of 

25 the S1 derivatives of pNP2) revealed an extensive array of possible hairpin loops. Attention was 25 
focused on the region of this mRNA containing the ribosome binding site since the sequestering 
of S-D sequences and initiation codons into secondary structure may inhibit the initiation of 
translation. It was found that a hairpin loop including the U and G of the initiation codon could 
be formed (see Fig. 4). This hairpin loop is formed entirely within coding sequences and has a 

30 AG = - 7.6 kcal. No secondary structures were found which buried the S-D sequence, and 30 
none which mutually excluded the hairpin loop described above, and which had a lower AG. 

To test whether expression of ft protein could be influenced by changes designed to alter 
potential secondary structure in the ribosome binding region we used synthetic oligonucleotides 
to mutagenise this region of ft mRNA. The plasmids were constructed by insertion of a pair of 

35 oligonucleotides between the blunted Cla I site and the 5' Pst site of ft in pNP2 (see Fig. 5). The 35 
cloning strategy used to accomplish this, however, was multi-staged due to the presence of four 
Pst sites in pNP2. The vector pACYCI 84CM, which lacks Cla I and Pst I sites, was first 
constructed by cutting pACYC184 at its unique Cla I site, 'filling-in' the cohesive termini using 
T4 DNA polymerase and religating. Next, PACYC184CM was cut with BamHI, and the BamHI 

40 fragment from pNP2, bearing the 5' ft sequence, was inserted into it to form pCM/x (see Fig. 4). 40 
The vector pCMft contained unique Cla I and Pst sites which were required for the oligonucleo- 
tide cloning (see Fig. 5). 

pCMft was cut with Cla I to completion, blunted with nuclease S1, and then cut with Pst I. 
The cut plasmid was purified by agarose gel electrophoresis and ligated separately with pairs of 

45 oligonucleotides, with the latter in a 100-fold molar excess. The pairs of oligonucleotides used 45 
were: R131 and R132 (for pNP1 1); R196 and R197 (for pNP12); and R202 and R203 (for 
pNP14). These have the sequences given below. 

TGCACATATGCAAGTGCAACTGCA (R131), GTTGCACTTGCATATGTGCA (R132), 
50 GCTGAACCTGCATATGTGCA (R 1 96), TGCACATATGCAGGTTCAGCTGCA (R 1 97), 50 
GTTGCACTTGCATTGATC (R202), GATCAATGCAAGTGCAACTGCA (R203). 

The ligation mixture was then used to transform HB101. 

Recombinant clones were detected by colony hybridisation, on nitrocellulose filters, using one 
55 of each pair of oligonucleotides as a probe. Positive clones were sequenced and shown to have 55 
the correct nucleotide sequence. The ft sequences containing the cloned oligonucleotides were 
excised on Eco RV-Bgl II fragments. These were ligated to Eco RV-Bgl II cut pNP2 to reconstruct 
the full length ft genes (see Fig. 5). Three different plasmids were created by these procedures; 
pNP1 1, pNP12 and pNP14, having the nucleotide sequences shown in Table 1 above. 
60 In the construct pNP1 1 the potentially deleterious hairpin loop of pNP2 was abolished by 60 
changing the residues in the degenerate position of three codons and by introducing a different 
A-D to AUD sequence. These changes allow the region between the S-D and AUG to base pair 
with the 5' end of ft coding sequence to form a hairpin loop of AG = - 7.8 kcal, approximately 
equal to that of pNP2, but leaving the AUG and S-D sequences exposed (see Fig. 4). The only 
65 other mutually exclusive hairpin loops are of AG = - 2.3, - 3.6 and + 0.4 kcal. 65 
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The construct pNP1 2 retains the S-D to AUG sequence of pNP1 1 , but has the coding 
sequence of pNP2, such that the hairpin loop of pNP2 can form (see Fig. 4). 

The final jx construct was pNP14, which had the S-D to AUG sequence of pNP9, but the 5' 
coding sequence of pNP1 1, so that the hairpin loop of pNP12 could not form. The construct 
5 pNP1 4 had no secondary structure burying the S-D or /t AUG. 5 

ju Protein Expression from Constructs with Differing mRNA Secondary Structures 

E.coliB strains containing plasmids pNP9, pNP1 1, pNP12 and pNP14 were grown in 
induction medium and samples removed for ELISA assay. The concentration of [i was increased 

10 6-7 fold in E.coli B cells containing pNpl 1 relative to pNP9 (see Table 1, Part b). Plasmid 10 
pNP1 2, which has the S-D to AUG sequence of pNP1 1, but the coding sequence of pNPm9, 
showed only a two-fold increase in level of /i expression compared to pNP9. 

Piasmid pNP14, which has the S-D to AUG sequence of pNP9 but the 5' /i coding sequence 
of pNP1 1, and thus differed from pNP9 only in three residues, was found to express ju protein 

15 over 90 times that found in pNP9. 15 
A plasmid, pNP8, has also been constructed, from which /t is expressed, also from the tip 
promoter, as part of a trpE-ji fusion protein containing the amino terminal 53 amino acids of 
rrpE and the carboxyl terminal 503 amino acids of fi. The rationale for making this fusion gene 
was that p would be translated from an efficient bacterial ribosome binding site rather than from 

20 one of unknown efficiency. Then, by comparing the amounts of trpE-p produced from pNP8 20 
with those of n produced from the different constructs, an estimate of relative ribosome binding 
efficiency may be made. When the frpE-ju gene in pNP8 was induced and the products 
quantitated by ELISA, it was found that the /i expression in strains containing pNP8 and pNP14 
were very similar (see Table 1, part B). The ribosome binding site (RBS) sequences of pNP1 1, 

25 pNP12 and nPnp14 were designed in order to test the hypothesis that sequestering of the 25 
initiation codon into secondary structure inhibits translation initiation, and to increase the rate of 
initiation by releasing the initiation codon. In pNP1 1, the RBS hairpin loop of pNP2 and pNP9 
was abolished by changing the 5' ju coding sequence in the degenerate position of three codons. 
In addition, the S-D to AUG sequence was changed to allow the base-pairing of this region with 

30 the 5' coding sequence. The result of these changes is that the initiation codon is exposed in 30 
the loop of a hairpin with the S-D sequence at the base of the stem (see Fig. 4). This hairpin 
loop had a AG approximately equivalent to the SG of the pNP9 hairpin loop. Strains containing 
pNP1 1 produced 6-7 times as much fi as did those with pNP9. To test whether this was due to 
the change in S-D to ATG sequence characteristics, rather than to the liberation of the initiation 

35 codon, pNP1 2 was constructed as a control. The latter has the S-D to AUG sequence of pNP1 1 , 35 
but retains the 5' ju coding sequence of pNP9, so that the RBS hairpin loop of pNP1 1 cannot 
form, but the potentially deleterious one of pNP9 can form. The level of p expression from 
pNP12 was found to be increased by only two-fold relative to pNP9. This increase is probably 
due to an altered S-D to AUG distance, a U immediately 5' to the initiation codon, and the S-D 

40 to AUG sequence being G-poor each of which has been shown to be advantageous. It is 40 
therefore likely that the increased expression from pNP1 1 is due to a preferable mRNA 
secondary structure. The RBS hairpin loop of pNP9 probably acts to inhibit translation while 
creation of another hairpin loop which exposes the initiation codon is responsible for the 
increased expression. 

45 The plasmid pNP1 4 was constructed, such that it had the S-D to AUG sequence of pNP9, but 45 
the coding sequence of pNP1 1 . The expression plasmid pNP14, therefore, differed from pNP9 
in only three residues, those in the degenerate position of three codons, yet strains containing 
pNP14 expressed over ninety times more fi than those containing pNP9. This increase in 
expression cannot be explained on the basis of the three residue changes optimizing codon 

50 usage, for they introduce less favoured codons, as judged by the frequency of their occurrence 50 
in strongly expressed E.coli genes. 

The increased expression is best explained by the abolition of the RBS hairpin loop in pNP9. 
The magnitude of the increases in expression observed, and the few residue changes 
introduced, makes it difficult for these results to be interpreted in terms of changes in mRNA 

55 stability. 55 
The expression of pNP8, which encodes the trpE-fi fusion protein from a native trpE RBS, was 
also examined, and found to express at approximately the same level as pNP14. This may 
indicate that the RBS of pNP1 4 is equivalent to that of frpE in terms of its efficiency in directing 
translation initiation. The native frpE RBS of pNP8 and that of pNP14 were found to have no 

60 secondary structures hindering the use of their RBS sequences. 60 

Expression of ju Protein in E. Coli 

E. ColiB cells containing the ju expression plasmid pNP1 1 were grown under inducing 
conditions and soluble and insoluble extracts prepared, and analysed by SDS-PAGE. A novel 
65 band was seen after staining the gel with Coomassie blue in the lane containing proteins from 65 
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the insoluble fraction (see Fig. 6, lane 2). This band was not seen in the negative control lane 
which contained proteins from the same fraction from cells harbouring pCT70 (see Fig. 6, lane 
3). The novel band was found to migrate to a position corresponding to a protein of a molecular 
weight within less than 5% of the actual molecular weight of non-glycosylated /i of 62.5Kd. A 
5 duplicate set of lanes were transferred to nitrocellulose, and Western blotted. Alignment of the 
stained gel and the blot autoradiogram confirms that this novel band is antigenically related to 
IgM (see Fig. 6, lanes 4 and 8). No band was found in extracts from cells containing pCT70 
(see Fig. 6, lanes 5 and 7). Only a low amount of n was found in the soluble fraction (see Fig. 

10 6 ^greatly increased level of expression of p was found in E. Coli B compared to HB101 . Pulse 
chase analysis demonstrated that in E. Coli B, a similar level of fi protein was detected after a 
60 minute chase (Fig. 7, lane 3) as was seen after the initial labelling period (Fig. 7, lane 1). In 
HB101, however, very little ju protein could be seen after a 10 minute chase (Fig. 7, lane 7), 
and none after 30 minutes (Fig. 7, lane 8), compared to the amount detected after the initial 

15 labelling period (Fig. 7, lane 5). 

For pulse chase analysis, inductions were set up as described above, except that the medium 
used consisted of: proline (0.3g/L), leucine (0.1 g/L), Difco methionine assay medium (5g/L), 
glucose (60mg/L), thiamine (10mg/L), CaCI 2 (22mg/L), MgS0 4 (0.25g/L)m and carbenicillin 
(0.1 g/L). During exponential growth cells were pulse labelled with 30 /zCi/ml L-[ 36 S]methionine 

20 for 2 minutes, after which unlabeled methione (100 )ig/m\) was added and the incubation 
continued for the times indicated. 

Induced E. Coli B cells harbouring pNPI 4 when examined by phase contrast microscopy were 
found to contain inclusion bodies. 

25 Expression of A Light Chain in E.Coli 

A fresh 1 ml overnight culture of £.Co//HB101 or RV308 containing the plasmid under study 
was grown in L Broth (Maniatis eta/,, Molecular Cloning, Cold Spring Harbor Laboratory, 
1 982), supplemented with carbenicillin to 1 00 jug/ml, 
A 1:100 dilution of this culture was made into M9 medium (Maniatis, 1982, op.cit.) 

30 supplemented with glucose, vitamin B„ carbenicillin, leucine and proline and the cultures 
shaken at 37" for 5-6 hours. At this time a5 S-methionine was added for 5-20 minutes. The 
cells were then harvested by centrifugation, lysed by boiling in 1 % SDS for 2 minutes and 
diluted by addition of a buffer consisting of 2% Triton X-1 00, 50 mM Tris pH 8, 0.1 5 M NaCI 
and 0.1 mM EDTA. Immunoprecipitations were carried out by addition of antisera to ahquots of 

35 labelled E.Coli extracts and, after incubation at 4° overnight, immune complexes were isolated 
by binding to Staphylococcus aureus fixed cells. The complexes were dissociated by boiling in 
60 mM Tris pH 6.8 buffer containing 10% glycerol, 3% SDS and 5% mercaptoethanol and the 
liberated proteins analysed on 10% or 12.5% acrylamide/SDS gels (Laemmli, UK. Nature, 
227, 680-685, 1970). Gels were stained with Coomassie Brilliant blue to visualise the protein 

40 bands while labelled proteins were detected on Fuji-RX film by fluorography using 1 M sodium 
salicylate. 

HB101 cells containing pNP3 and other plasmids were grown under inducing conditions to 
an OD 600 = 0.6 and the proteins present examined by specific immunoprecipitation substantially 
as described above. The results obtained are given in Fig. 8, in which lanes 1 and 2 respectively 

45 were extracts from pCT54-19 and pNP3, immunoprecipitated with normal rabbit serum; lanes 
3, 4 and 5 were extracts immunoprecipitated with rabbit anti-lambda serum and represent pNP4 
(lane 3), pNP3 (lane 4) and pCT54-1 9 (lane 5). The position of unlabelled lambda protein from 
M0PC104E is indicated on the left hand side of Fig. 8. All of the plasmids produced a protein 
which reacted with rabbit anti-mouse A light chain serum and comigrated with authentic A, light 

50 chain from the mouse myeloma MOPC104E. pNP3 however produced the most of this protein 
(compare lane 4 with lanes 3 and 5 in Fig. 8). No such band was detected with control 
immunoprecipitations using normal rabbit serum (Fig. 8, lanes 1 and 2). Studies using brief 
pulses with 36 S-methionine followed by chasing with excess cold methionine indicated that the 
recombinant A light chain had a 1 /2 life of about 20 minutes. 

55 In vitro transcription /translation (Pratt ef al, Nucleic Acids Research, 9, 4459-447, 1981) 
confirmed that pNP3 coded for a protein which comigrated with authentic X light chain. This 25 
Kd product was synthesised in vitro at a rate comparable to that of ^-lactamase indicating that it 
is synthesised in vivo at a level of 0.5% of total E.Coli proteins. This figure is in good 
agreement with the percentage of recombinant A product synthesised in vivo (0.4-2%) as 

60 determined from the following equation: 

% specific CPM = 100 X (CPM with anti-A)-(CPM with normal serum) 



total CPM 
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The procedure used to disrupt cells was as follows (Emtage J S et al, Proc. Natl. Acad. Sci., 80, 
3671-3715, 1983). ECo//HB101/pNP3 grown under inducing conditions were harvested and 
resuspended in 0.05 M Tris pH 8, 0.233 M NaCI, 5% glycerol containing 130 g/ml of 
lysozyme and incubated at 4°C or room temperature for 20 minutes. Sodium deoxycholate was 
5 then added to a final concentration of 0.05% and 1 0 jttg of DNAase 1 (from bovine pancreas) 5 
was added per g wet wt of E.Coli. The solution was incubated at 150°C for 30 minutes by 
which time the viscosity of the solution had decreased markedly. The extract was centrifuged (at 
10,000 X g for 1 5 minutes for small volumes (1 ml) or 1 hour for larger volumes) to produce a 
soluble and an insoluble fraction. 

10 For immunoprecipitations, the soluble fraction was diluted in the Triton-containing buffer 10 
described above and the insoluble fraction solubilised by boiling in 1 % SDS followed by dilution 
in Triton-containing buffer. HB101 cells containing pNP3 or pNP4 were grown under inducing 
conditions, pulse labelled with [ 3B S] methionine, separated into soluble and insoluble fractions 
and analysed by sodium dodecyl sulfate polyacrylamide gel electrophoresis. The results obtained 

15 are shown in Fig. 9 in which lanes 1 and 3 correspond to the soluble fraction from HB101- 15 
pNP3 immunoprecipitated with normal rabbit serum and rabbit anti-lambda serum respectively; 
lanes 2 and 4 correspond to the insoluble fraction from HB101-pNP3 immunoprecipitated with 
normal rabbit serum and rabbit anti-lambda serum respectively, lanes 5 and 7 correspond to the 
soluble fraction from HB101-pNP4 immunoprecipitated with normal rabbit serum and rabbit 

20 anti-lambda serum respectively; lane 9 corresponds to the soluble fraction from HB101-pNP3, 20 
and lane 10 corresponds to the insoluble fraction from HB101-pNP3. The position of unlabelled 
lambda protein from MOPC104E is indicated on the left hand side of Fig. 9. When such a 
procedure was carried out using pNP3, the recombinant X light chain protein was present in the 
insoluble fraction (Fig. 9, lane 4) rather than the soluble fraction (Fig. 9, lane 3). No 25 Kd 

25 band comigrating with authentic X, light chain was present when either the soluble or insoluble 25 
fraction were immunoprecipitated using normal rabbit serum (Fig. 9. lanes 1 and 2). By 
inference it is likely that all Ig light chains will fractionate as in this specific example. 

In the absence of specific immunoprecipitation a novel protein band was not visible from 
extracts of HB101 containing pNP3 nor was the A protein found to accumulate. However, there 

30 was a dramatic difference when pNP3 was induced in the K12 strain E103s. In this strain A 30 
protein was found to accumulate during induction until the cells reaches stationary phase (Fig. 
10, lanes 3-5) to a level of about 1 50 times that found in HB101 as determined by an ELISA 
(enzyme linked immunosorbent assay). These cells were found to contain inclusion bodies which 
appeared refractile under light microscopy, a phenomenon characteristic of high level expression 

35 of foreign proteins. An estimate of the percentage of total E. Coli protein represented by 35 
recombinant A protein was obtained by separating the proteins by gel electrophoresis, staining 
them with Coomassie blue and scanning the stained gel with a Joyce-Loebl chromoscan 3. This 
method showed that X was the major protein present (Fig. 10, lane 5) and represented 13% of 
total E. Coli protein. The X protein had a half-life of 20 minute in HB101 but accumulated to 

40 very high levels in El 03s, suggesting that Lambda protein was much more stable in the latter 40 
strain. After cell lysis and centrifugation of HB101 or E103s containing pNP3, X light chain was 
detected in the insoluble (Fig. 10, lanes 7 and 10) but not in the soluble fractions (Fig. 10, 
lanes 6), as determined by Coomassie blue staining. The identity of the major Coomassie blue 
stained band as X protein was confirmed by Western blot analysis (Fig. 10, lanes 8-10). The 

45 presence of such immunoreactive bands was specific to pNP3. containing cells. When extracts 45 
from cells containing pCT70, a prochymosin expressing plasmid, were subjected to the same 
analysis no bands were detected. This more sensitive technique showed that a small amount of 
the X protein was in the soluble fraction (Fig. 10, lane 9). The presence of a number of distinct 
immunoreactive proteins all smaller than full-length protein were also detected. These may result 

50 from proteolytic degradation of X protein, from premature termination of transcription or from 50 
internal initiation of translation. 

Expression of p and Lambda Polypeptides in the same Bacterial Cell 
Each of the Ig ju, and X genes in expression plasmids were transformed into the same E.Coli 

55 cell to direct the synthesis of both Ig /i and X polypeptides. In order to overcome plasmid 55 
incompatibility and provide a second antibiotic resistance marker, the trp promoter and X 
sequences were excised from pNP3 on a Hind Ill-Bam HI fragment and inserted into the Hind 
Ill-Bam HI fragment of pACYC 184. The resultant plasmid pACYC X caused the E.ColiXo grow 
very poorly. This weak growth was thought to result from read-through of RNA polymerase into 

60 the origin of replication. However, the inhibition of growth was virtually eliminated by the 60 
cloning in of the bacteriophage T7 transcriptional terminator at the Hind III site of plasmid 
pACYC X. 

This terminator functions in both orientations. The resultant plasmid pAC X T7-1 has a 
chloramphenicol resistance gene and an origin compatible with the pBR 322-derived origin on 
65 pNP14, the Ig fi expressing plasmid. Transformation of both plasmids in the same E.Coli B was 65 
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achieved in two steps. Firstly pNP14 was introduced followed by pAC X T7-I in two sequential 
transformations to give ampicillin and chloramphenicol resistant clones. 

E.Coli B cells derived form this double-transfomant clone showed the presence of inclusion 
bodies and two novel polypeptide bands on stained gels of the insoluble fraction after lysis. 
5 These two bands correspond both with immunological activity by Western blotting for Ig n and 
Ig X and in their expected molecular weights. It has thus been shown that the double- 
transformant clone expresses both the heterologous genes. This had not hitherto been shown. 

The presence of X light chain in the insoluble fraction was a useful purification step since it 
both concentrated the protein and separated it from the bulk of E. Coli soluble proteins. 

1 0 For further purification of the X light chain, the cell debris were dissolved in 1 0mM Tris-HCI 
pH8.0, 25% formamide, 7M urea, 1mM EDTA and 2mM dithiothreitol. This material was 
loaded onto a DEAE Sephacel column (Pharmacia) (1 X 25cm) at a flow rate of 5ml/hr) which 
had been equilibrated in 9M urea, 10mM Tris-HCI pH8.0, 1mM EDTA and 2mM DTT. The 
DEAE Sephacel column was developed using a 0-1 50mM NaCI gradient in loading buffer. The 

1 5 eluted peak A light chain immunoreactivity, corresponding to the major peak of protein, was 
diluted to a final concentration of 2.25M urea, 10mM Tris-Hcl pH8.0, 1mM EDTA, 2mM DTT 
and loaded onto an octyl-Sepharose column (Pharmacia) (2.5 X 10cm). Material was eluted by 
use of a urea gradient of 2. 2 5-9 M urea. The peak material was pooled, dialysed into 
ammonium bicarbonate and lyophilised. Following this step, only a single band of Coomassie 

20 blue stainable material corresponding to recombinant X protein was visualised by SDS-PAGE. 
The ji heavy chain was purified from 9M urea solubilised pellets by anion exchange 
chromatography and chromatofocussing (Pharmacia). 

It was of great interest to determine whether the concomitant expression of /t and X would 
lead to the formation of functional IgM. In order to determine this, extracts were made from £. 

25 Coli containing both Ig /x and X polypeptides and these tested for antigen binding. We used a 
two-site sandwich ELISA which detects /i chain binding to haptenalated bovine serum albumin 
(NIP-caproate-BSA). 

In the NIP binding assay, bovine serum albumin (BSA) is reacted with an equimolar amount 
of NIP-Cap-N-hydroxy-succinamide at pH 7.5 in 10mM phosphate buffer. The resulting NIP- 
30 Cap-BSA is separated from free NIP-Cap on a G-50 Sephadex column. 

Microtitre plates (96 well Nunc Immuno Plate 1) are coated with 100 jul of a solution of 10 
jug/ml NIP-BSA in sodium carbonate/sodium bicarbonate 0.1 M, pH 9.6 buffer (coating buffer) 
overnight at 4°C. 

The coated plates are then blocked for non-specific binding by addition of 100 jul of 0.5% 
35 casein in coating buffer (blocking buffer) and incubating at 37°C for 1 hour. 

The coated and blocked plates are then washed three times in 150 mM NaCI, 0.05% NP40, 
20mM Tris-HCI, pH 7.8 buffer (washing buffer). 

The washed plates are shaken free of excess washing buffer and samples are added in their 
buffer or typically in 0.5% casein in washing buffer (sample buffer) to a 100 jul final volume. 
40 In order to demonstrate the nature of the binding to NIP-BSA, samples for testing were 
supplemented with either NIP or NP is free solution at various concentrations (from 60 to 0.3 
/iM for NIP or from 600 to 3 pM for NP). 

The plates with samples are then incubated at 37X for one hour and thereafter washed three 
times with washing buffer. 
45 The washed plates are then innoculated with 100 pi of anti ^-peroxidase conjugate (1 : 1000 
dilution; TAGO Inc) in sample buffer, and incubated for one hour at 37°C. The plates are then 
washed three times in washing buffer. 

The washed plates are innoculated with 100 jtl of 0.1 M Na acetate-citrate, pH 6.0, 0.1 
mg/ml tetramethylbenzidine, 13mM H 2 0 2 (peroxidase substrate) and incubated at room 
50 temperature for one hour. The reaction is terminated by the addition of 25jul of 2.5M H 2 S0 4 . 

The microtitre plates are then read in a plate reader (Dynatech) at 4507jm with a reference of 
6307jm. The A 450 was related to the level of a standard protein, B1-8 anti-NIP IgM. 

This assay demonstrates sensitivity to 60pg of B1-8 IgM. The extracts were prepared as 
soluble and insoluble material. The insoluble material was solubilized in the same buffer used in 
55 lysis but containing 8M urea followed by its dilution for assay. 

In order to obtain activity for the Ig /u and X, extracts were made of the insoluble fraction and 
these dialysed into buffer conditions in which disulphide interchange will occur at a higher 
frequency. 

Production of functional antibodies from E. Coli expressing both heavy and light chains was 
60 achieved by lysing the cells and clarifying the supernatant by centrifugation. The insoluble 

material was washed, followed by sonication (3 times for 3 minutes), and finally dissolved in 9M 
urea, 50mM glycine-NaOH pH10.8, 1mM EDTA, and 20mM 2-mercaptoethanol. This extract 
was dialysed for 40 hours against 3 changes of 20 vols, of 100mM KCI, 50mM glycine-NaOH 
pH10.8, 5% glycerol, 0.05mM EDTA, 0.5mM reduced glutathione and 0.1 mM oxidised 
65 glutathione. The dialysate was cleared by centrifugation at 30,000g for 1 5 minutes and loaded 
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directly onto DEAE Sephacel, followed by development with a 0-0.5M KCI linear gradient in 
10mM Tris-HCI, 0.5mM EDTA, pH8.0. 

The purified Ig //. and X were treated as above, except that no anion exchange chromatography 
was carried out. The preparation was finally dialysed into phosphate buffered saline, 5% 
5 glycerol, 0.01% sodium azide and 0.5mM EDTA pHTA pH7.4. 5 
The results from assays of material processed in this way indicated that some activity was 
obtained. The level of activity obtained in this way was too low to do any detailed studies on, so 
the resultant dialysate was purified by anion exchange chromatography (Fig. 1 1). This process 
resulted in the isolation of significant NIP-cap-BSA binding activity over that of background 

10 binding to BSA (Fig. 1 1), The assay of the fractions for the level of Ig ft, expressed as B 1-8 10 
IgM equivalents demonstrated two peaks of activity. This was not found to correlate with full 
length Ig ft by Western blotting. The first peak observed may represent a fragment of Ig ft; The 
separation of NIP-CAP-BSA binding activity from the majority of full length Ig ju and protein 
indicates that the hapten binding activity is contained within a particular molecular species 

1 5 formed at low efficiency. 1 5 

The processing of insoluble material obtained from Ig ft expression in E. Coli produced a 
similar IgM protein profile but without NIP-cap-BSA binding activity. This demonstrates that the 
activity recovered was a property of the combined immunoglobulin expression, not of some E. 
Coli factor, or of the Ig /i heavy chain alone. 

20 Further studies of the characteristics of the hapten (NIP-cap-BSA) binding were carried out. As 20 
samples were diluted they showed less binding to hapten in a very similar way to the original 
antibody (Fig. 12). Free hapten was found to inhibit most of the binding activity in both 
indiluted and diluted samples. Using B1-8 antibody as a standard for both IgM and hapten 
binding, the specific activity of the assembled antibody was calculated to be 1 .4 X 10" gm/gm 

25 of IgM equivalents. This value demonstrates the inefficient recovery of activity, but possibly 25 
represents an underestimate of the specific activity due to an overestimate of full-length Ig ft in 
these fractions, as described above. 



Heteroclitic Nature of Recombined Antibody 
30 Detailed specificity of binding to NIP-cap-BSA was investigated by comparing the assembled 30 

antibodies with B1-8 IgM in the presence of free NIP-cap and NP-cap (Fig. 13). Both B1-8 

IgM and the assembled antibodies showed that higher NP-cap than NIP-cap concentrations were 

required to inhibit NIP-cap-BSA binding. 
The heteroclitic nature is demonstrated by the molar ratio of NIP to NP at 50% inhibition. The 
35 concentrations of NIP and NP at 50% inhibition (I50) were found to be similar for both B1-8 35 

and the assembled antibodies as shown in Table 2. Also the NP I50/NIP 150 ratios were similar 

(Table 2). 

Table 2. Hapten concentration at 50% inhibition (150) of binding of 
40 antibodies to NIP-cap-BSA solid phase. 40 
SD = Standard Deviation. 



rVP/50 

45 NIP 150 uM NPISOuM NIP 150 

B1-8lgM 0.13 (SD, 0.05) 3.7 (SD, 2.9) 29 

Fraction 26 0.34 (SD, 0.09) 1.9 (SD, 0.4) 6 

Fraction 27 and 28 0.1 1 (SD, 0.02) 1.1 (SD, 0.3) 10 

50 Purified /i and X 0.4 0.84 22 



It has thus been shown that not only is it possible to express /x and X proteins having intact 
variable domains, but also that it is possible to express both ji and A proteins on compatible 
55 plasmids in the same cell. This latter has not hitherto been disclosed or even suggested. 55 
Moreover, it proved possible to derive functional Ig molecules from E. Coli cells which expressed 
both /i and X proteins. 

It has thus been shown that the process of the present invention produces a functional Ig 
molecule by recombinant DNA techniques. 
60 60 
Construction of Expression Plasmids for Yeast 

The plasmids used for expression in yeast (Saccharomyces cerevisiae) are based on pBR322 
and the yeast 2 micron plasmid. These plasmids are the subject of copending European Patent 
Application EP 0 073 635, the disclosure of which is incorporated hereby reference. The yeast 
65 phosphoglycerate kinase gene (PGK) provides the 5' sequence necessary for initiation of 65 
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transcription and insertion of genes can be made at the unique Bglll site in the plasmid pMA 
3013 described in the above mentioned European patent application. This plasmid has now 
been renamed pMA91 as is so referred to hereinafter. 

a) mef-mu(ju) 

5 Bglll and Bell produce compatible 5'GATC ends. The met-jtt gene from pNP2 was excised on 
a partial Bell fragment and ligated into the unique Bglll site of pMA91 . 

b) Pre-mu(n) 

The plasmid pCT54 u containing the full length pre-ju cDNA was digested with Hind III. This 
cuts at the 3' side of the (i cDNA. This Hind III site was changed to a Bell site by incubation 

10 with T 4 DNA polymerase in the presence of all 4 nucleotides. The linker R107 of sequence 
TTTTGATCAAAA, which contains an internal Bell was ligated to the DNA obtained above. After 
ligation one aliquot of the resultant DNA was digested with Bell and Accl and the ju Accl-Bcll 
fragment was isolated from the resultant mixture by gel electrophoresis. A separate aliquot of 
the resultant DNA was digested with Mboll and ligated with chemically synthesised oligonucleo- 

15 tide linkers R121 and R1 12 which have cohesive Bell and Mboll ends. 

R121 5' GATCAATGGGATGGAGCTGT 3' 
R112 5' TTACCCTACCTCGAC 3' 

20 The ligation reaction was terminated and the resultant DNA digested with Accl to give a p 
Mboll-Accl fragment which was isolated from the resultant mixture by gel electrophoresis on a 
5% polyacrylamide gel. 

The two jti fragments obtained above were ligated together and then digested with Bell to 
eliminate unwanted ligation products after which the resultant pre-ju DNA was ligated to Bglll 

25 cut plasmid pMA91. 

The resultant ligation mix was used to transform E.Coli strain HB101 to ampicillin resistance. 
A colony of the transformed bacteria was isolated and was found to contain a plasmid exhibiting 
the predicted enzyme digestion pattern. The 5' end of the inserted DNA from this plasmid was 
sequenced and shown to have the anticipated sequence for a pre-ju cDNA. 

30 c) met-lambda (X) 

The met-X DNA was cloned into the Bglll site of plasmid pMA91 . Plasmid pCT54 Clone 1 
was cut with Hind III to the 3' side of the met-X cDNA and this cleavage site was altered to a 
Bell site in a similar manner as previously described for pNP2. Following this the plasmid DNA 
was cut 5' of the X gene at the Bell site and the resultant X DNA was cloned into the Bglll site 
35 of pMA91. 

d) Pre-lambda (X) 

Pre-X cDNA was reconstructed into pMA 91 as follows. The plasmid pCT54 was digested with 
Bell and Hind III and the resultant vector DNA was isolated by gel electrophoresis and ligated 
together with two synthetic oligonucleotides, R162 and R163, and the two fragments from 
40 plasmid pATXl-15. 

R162 5' GATCAATGGCCTGGATT 3' 
R163 5' GTG AA ATCCAG G CC ATT 3' 

45 These fragments were produced by digesting pATX1-15 with Fokl and Hind III and isolating 
the 5' 300 base pair Fokl fragment and the 3' 600 base pair Fokl-Hind III fragment by gel 
electrophoresis. The resultant ligation mixture was used to transform E.Coli strain HB101 to 
ampicillin resistance and a colony of bacteria containing a plasmid having the predicted correct 
restriction enzyme digestion pattern was isolated. The 5' end of the inserted DNA was 

50 sequenced and found to have the anticipated sequence of a pre-X cDNA. This plasmid was 
digested with Hind III and the resultant Hind III cleaved ends converted to a Bell site by 
blunting with T 4 polymerase and ligating with the linker R107. The resultant plasmid was then 
digested with Bell and the pre-X cDNA isolated on a Bell fragment by gel electrophoresis. This 
Bell fragment was then ligated to Bgll cut pMA91 to give a pMA 91 plasmid containing the full 

55 length pre-X cDNA. 

Expression of Immunogloublin Genes in Yeast 

The pMA91 derivative plasmids containing the pre-X and pre-/i genes as prepared above were 
used to transform Saccharomyces cerevisiae yeast host organisms and the pre-X and pre-fi genes 
60 were expressed by the transformed cells. 

Saccharomyces cerevisiae strain MD46 when transformed with pMA91 pre-X or pMA91 pre-/* 
gave rise to colonies which expressed immunoreactive proteins as revealed on Western blots. 
Yeast cells containing plasmid pMA91 pre-X produced a protein which reacted with anti-X 
antiserum and co-migrated on polyacrylamide gels with bacterially synthesised mature X. 
65 Similarly yeast cells containing pMA 91 pre-ju produced a protein which reacted with anti-/i 
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antiserum and co-migrated on polyacrylamide gels with bacterially synthesised mature ju protein. 
These observations indicate that both pre-X and pre-/i are being processed to the corresponding 
mature proteins within the yeast host cell environment. 
Additionally the n protein product (but not the A protein) was shown to be glycosylated. Cells 
5 were incubated in the presence or absence of tunicamycin at a concentration of 15 /tg/ml. This 5 
compound specifically arrests the N-linked glycosylation of proteins. Cell extracts derived from 
cells incubated in the absence of tunicamycin showed higher molecular weight bands (as 
revealed by Western blotting) whilst extracts from cells incubated in the presence of tunicamycin 
showed no such higher molecular weight bands. 
10 When cells were lysed with glass beads and the soluble and insoluble fractions examined it 10 
was found that the /t and A proteins were exclusively in the insuluble fractions, as determined by 
Western blotting. 

Secretion 

1 5 After incubation transformed yeast cells were spun down and 1 ml volumes of supernatant ■ 1 5 
were removed and passed through BSA-coated 0.2 /iM Millipore filters to remove any remaining 
cellular material. ELISA assays for /i and A protein were then carried out on the filtered 
supernatants. Only supernatants from pMA91 pre-A harbouring cells showed detectable levels of 
immunoglobulin protein. Increased amounts of A protein was detected in the supernatant when 

20 cells were grown in minimal medium to OD 660 = 0.2 and then spun down and resuspended in 20 
YPA B and harvested at OD 660 = 1 .0-1 .5. Thus the A protein but not the ju protein is secreted 
from yeast cells. 

Intracellular Location 

25 After incubation cells containing pMA91 pre-^ were converted to spheroplasts and fixed in 25 
5% acetic acid/95% ethanl (v/v) and incubated with fluorescein conjugated goat anti-mouse p.. 
The fixed cells were examined by fluorescent microscopy and the p. protein was found to be 
localised in the periphery of the spheroplasts and especially in vacuoles. 

30 Expression of both ju and A Proteins in the Same Cells 30 
In order to express both genes in the same host cells it was necessary to provide compatible 
plasmids for use in transformation. The pre-A gene was excised from pMA91 pre-A on a Hind III 
fragment and inserted into the Hind III site of plasmid pLG89 (Griti L, and Davies J, Gene, 25 
179-188, 1983). This plasmid contains a ura3 marker which can be used as a positive 

35 selection for transformed host cells. A convenient ura3~ host organism is S. cerevisiae strain 35 
X4003-5B. Both plasmids, pMA91 pre-ju and pLG89 pre-A, were transformed into this strain, 
and colonies were grown which contained together both the ura3 and Ieu2 markers. After 
incubation of the transformed cells both A and p proteins together were detected in the same 
cultures of X4003-5B using ELISA techniques. Levels of expression were comparable to those 

40 obtained for the individual genes in MD46 cells and for the A gene alone in X4003-5B. 40 
In addition, to check for assembly of the A and fi proteins in vivo the following procedure was 
followd. 

After the growth to an OD 660 s1 , the transformed X4003-5B cells were spun down and 
resuspended in buffer, either 5 mM borate buffer at pH 8.0 or phosphate buffer at pH 5.8 

45 either with or without detergent (e.g. 0.5% Triton X). The suspended cells were then lysed by 45 
vortexing with glass beads in buffer as above and the insoluble material was spun down. After 
centrifugation aliquots of the supernatant were assayed in a NIP binding assay (as described 
previously for assembled E.Colip, and A proteins). Specific antigen binding activity was detected 
in the supernatant and this activity was shown to be specifically competed out by free NIP (the 

50 specific antigen). These results indicated that p and A protein were expressed within the X4003- 50 
5B cells and assembled into functional immunogloublin molecules in vivo. 

In the above experiments the culture medium used for incubating the transformed cells was 
yeast minimal medium (containing, per litre, 6.7 g DIFC0 yeast nitrogen base without amino 
acids, 1 0 g glucose, and 200 mg each of histidine tryptophan, methionine and adenine). When 

55 the A protein alone was being expressed the medium contained in addition 200 mg/l of leucine, 55 
and when the p, protein alone was being expressed the medium contained in addition 200 mg/l 
of uracil. 

CLAIMS: 

60 1 . A process for producing a heterologous multi chain polypeptide or protein in a single host 60 
cell, which comprises transforming the host cell with DNA sequences coding for each of the 
polypeptide chains and expressing said polypeptide chains in said transformed host cells. 

2. A process according to claim 1 , wherein each DNA sequence is inserted individually into 
a vector and the individual vectors are used to transform the single host cell. 

65 3. A process according to claim 1 , wherein at least two of the DNA sequences are inserted 65 
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into separate parts of a vector which is used to transform the single host cell. 

4. A process according to claim 2 or claim 3 wherein the or each vector is a plasmid. 

5. A process according to claim 4, wherein the plasmid is col E1, pcR1, pBR322, RP4 or 
phage DNA. / 

5 6. A process according to any one of claims 1 to 5 wherein the host cell is a bacterium or a 5 
yeast. 

7. A process according to claim 6, wherein the host cell is E.Coli, B. subtilis or S.cerevisiae. 

8. A process according to claim 7, wherein the host cell is E.Coli strain HB1 01 , X1 776, 
X2887, PS410, MRC 1 RV308, B or E1035. 

10 9. A process according to any one of claims 1 to 8, wherein the polypeptide chains are 10 

expressed and secreted by the host cell. 

10. A process according to any one of claims 1 to 8, wherein the polypeptide chains 

expressed by the host cell are solubilised under conditions which facilitate assembly of the active 

multi chain polypeptide or protein. 
15 11. A process according to any one of claims 1 to 10, for producing an Ig molecule or an 15 

immunologically functional Ig fragment, wherein the host cell is transformed with DNA 

sequences coding for at least the variable domains of the Ig heavy and light chains. 
1 2. A process according to claim 1 1 , wherein the DNA sequences code for the complete Ig 

heavy and light chains. 

20 13. A process according to claim 1 1 or claim 12, for producing an Ig molecule or fragment 20 
having at least one constant domain, wherein the constant domain is derived from the same 
source as the variable domain to which it js attached. 

14. . A process according to claim 1 1 or claim 12, for producing an Ig molecule or fragment 
having at least one constant domain, wherein the constant domain is derived from a different 

25 source for that from which the variable domain to which it is attached is derived. 25 

1 5. A process according to any one of claims 1 1 to 1 4, wherein the DNA coding sequences 
for the Ig molecule or fragment are derived from one or more monoclonal antibody producing 
hybridomas. 

16. A process for increasing the level of protein expression in a transformed host cell 

30 comprising altering the vector DNA sequence in the region of the initiator ATG codon to one 30 
which substantially eliminates RNA secondary structure. 

1 7. A process according to claim 1 6 wherein the DNA sequence is altered to minimise the 
absolute value of the AG value of the sequence. 

18. A heterologous multi chain polypeptide or protein produced by recombinant DNA 

35 technology from a single host cell. 35 

19. An Ig molecule or an immunologically functional Ig fragment produced by recombinant 
DNA technology from a single host cell. 

20. An Ig molecule or fragment according to claim 19, including at least one constant 
domain derived from the same source as the variable domain to which it is attached. 

40 21 . An Ig molecule or fragment according to claim 19, including at least one constant 40 

domain derived from a source different from the source from which the variable domain to 

which it is attached was derived. 
22. An Ig fragment according to claim 20 or claim 21, comprising only one constant 

domain attached to each variable domain. 
45 23. An Ig molecule according to claim 20 or claim 21, comprising sufficient constant 45 

domains attached to each variable domain to produce a complete Ig molecule. 

24. An Ig heavy or light chain or a fragment thereof having a complete variable domain, as 
a product of recombinant DNA technology. 

25. A vector for use in a process according to any one of claims 1 to 1 7. 

50 26. A vector for use in transforming a host cell to enable it to produce a molecule according 50 
to any one of claims 1 8 to 24. 

27. A vector according to claim 25 or claim 26, which is a plasmid. 

28. A host cell transformed with a vector according to any one of claims 25 to 27. 

29. A therapeutic composition comprising a molecule according to any one of claims 18 to 

55 24 and a pharmaceutically acceptable carrier. 55 

30. A process for preparing a composition according to claim 29, comprising mixing a 
molecule according to any one of claims 1 8 to 24 with a pharmaceutically acceptable carrier. 

31 . An affinity chromatography medium comprising a molecule according to any one of 
claims 18 to 24 immobilised on a chromatographic medium. 

60 32. An immunological assay method in which the or each antigen binding Ig molecule or 60 
fragment is a molecule or fragment according to any one of claims 1 9 to 24. 

33. An Ig molecule or fragment according to any one of claims 1 8, to 24 having attached 
thereto a cytotoxic, enzymic or structural polypeptide moiety synthesised in the cell with the Ig 
molecule or fragment. 

65 34. A process for producing a heterologous multichain polypeptide or protein in a single 65 
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host cell, substantially as hereinbefore described. 

35. A process for increasing the level of protein expression in a transformed host cell 
substantially as hereinbefore described. 

36. A molecule according to any one of claims 18 to 24 substantially as hereinbefore 

5 described. 5 

37. A vector according to claim 25 or claim 26 substantially as hereinbefore described. 

38. A host cell according to claim 28, substantially as hereinbefore described. 
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